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METHOD AND APPARATUS FOR STORYTELLING WITH DIGITAL 
PHOTOGRAPHS 

FIELD OF THE INVENTION 

The present invention relates generally to field of image retrieval and 
5 organization. More specifically, the present invention is directed to multimedia 
creation using digital objects (e.g., images). 

BACKGROUND 

Photographs play a central role in many types of informal storytelling. 

10 One of the most common and enjoyable uses for photographs is to share stories 
about experiences, travels, friends and family. Almost everyone has experience 
with this form of storytelling, which ranges from the exchange of personal 
reminiscences to family and cultural histories. The World Wide Web can 
facilitate the sharing of such stories in digital form and has inspired a 

15 movement towards "digital storytelling.'' Stories in digital form are referred to 
herein as digital stories. Digital photographs have an advantage over print 
photographs in that users can search for and retrieve them both by their content 
(e.g., features such as color and texture) and by their metadata (e.g., user- 
supplied text annotations). Today, most digital stories are created by people 

20 with computer skills using special-purpose software for editing images and 

authoring Web pages. Furthermore, these digital stories are created on systems 
relying on graphical interfaces dependent on windows, or tables, and some 
type of pointing or cursor control device. Such systems are typically not 
portable and are complex to use. These features make these systems difficult to 
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share. These systems also reqiiire significant training and are inadequate for 
mass consumer use. 
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SUMMARY OF THE INVENTION 



A method and system that combines capabilities for storing, authoring, 
and viewing various forms of digital media are described. In one embodiment, 
a visual interface having three areas is provided. The first area displays three 
tracks of images. One track displays images that are stored on the device. A 
second track displays images of authored stories. The third track displays one 
or more images associated with a story currently being authored on the device. 
Control buttons are available to navigate among the three tracks and among the 
images on each track. 

The second area of the visual interface displays a larger version of an 
image corresponding to a thumbnail image selected in any of the three tracks in 
the first area. 

The third area of the visual interface displays a representation of one or 
more audio clips and other information associated with the image being 
displayed in the second area of the visual interface. 
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BRIEF DESCRIPTION OF THE DRAWINGS 



The present invention is illustrated by way of example in the following 
drawings in which like references indicate similar elements. The following 
drawings disclose various embodiments of the present invention for purposes 
of illustration only and are not intended to limit the scope of the invention. 

Figure 1 illustrates an exemplary embodiment of the display screen of 
one embodiment of a digital story-telling system. 

Figure 2 illustrates an exemplary embodiment of control buttons of the 
digital story-telling system. 

Figure 3 illustrates an exemplary expand/ collapse view of a story. 

Figure 4 illustrates an exemplary XML metadata file that the system 
reads upon initialization of the system. 

Figure 5A illustrates an exemplary XML metadata for a story. 

Figure 5B illustrates another exemplary XML metadata file for a story. 

Figure 6 illustrates an exemplary XML metadata file for a single 
photograph. 
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DETAILED DESCRIPTION 

A multimedia story creation and playback system is disclosed. The 

following detailed description sets forth numerous specific details to provide a 

5 thorough understanding of the invention. However, those of ordinary skill in 

the art will appreciate that the invention may be practiced without these 

specific details. In other instances, well-known methods, procedures, protocols, 

components, algorithms, and circuits have not been described in detail so as not 

to obscure the invention. 

10 Some portions of the detailed descriptions that follow are presented in 

terms of algorithms and symbolic representations of operations on data bits 
within a computer memory. These algorithmic descriptions and 
representations are the means used by those skilled in the data processing arts 
to most effectively convey the substance of their work to others skilled in the 

15 art. An algorithm is here, and generally, conceived to be a self-consistent 
sequence of steps leading to a desired result. The steps are those requiring 
physical manipulations of physical quantities. Usually, though not necessarily, 
these quantities take the form of electrical or magnetic signals capable of being 
stored, transferred, combined, compared, and otherwise manipulated. It has 

20 proven convenient at times, principally for reasons of common usage, to refer 
to these signals as bits, values, elements, symbols, characters, terms, numbers, 
or the like. 
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It should be borne in mind, however, that all of these and similar terms 
are to be associated with the appropriate physical quantities and are merely 
convenient labels applied to these quantities. Unless specifically stated 
otherwise as apparent from the following discussion, it is appreciated that 
5 throughout the description, discussions utilizing terms such as "processing" or 
"computing" or "calculating" or "determining" or "displaying" or the like, refer 
to the action and processes of a computer system, or similar electronic 
computing device, that manipulates and transforms data represented as 
physical (electronic) quantities within the computer system's registers and 

10 memories into other data similarly represented as physical quantities within the 
computer system memories or registers or other such irvformation storage, 
transmission or display devices. 

The present invention also relates to apparatus for performing the 
operations herein. This apparatus may be specially constructed for the required 

15 purposes, or it may comprise a general purpose computer selectively activated 
or reconfigured by a computer program stored in the computer. Such a 
computer program may be stored in a computer readable storage medium, 
such as, but is not limited to, any type of disk including floppy disks, optical 
disks, CD-ROMs, and magnetic-optical disks, read-only memories (ROMs), 

20 random access memories (RAMs), EPROMs, EEPROMs, magnetic or optical 
cards, or any type of media suitable for storing electronic instructions, and each 
coupled to a computer system bus. 
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The algorithms and displays presented herein are not inherently related 
to any particular computer or other apparatus. Various general-purpose 
systems may be used with programs in accordance with the teachings herein, or 
it may prove convenient to construct more specialized apparatus to perform the 
5 required method steps. The required structure for a variety of these systems 
will appear from the description below. In addition, the present invention is 
not described with reference to any particular programming language. It will 
be appreciated that a variety of programming languages may be used to 
implement the teachings of the invention as described herein. 

m 10 

rf Overview 

m 

A multimedia storytelling system that provides a user capabilities to 

t: 

jnj share digital photographs and stories. In one embodiment, the system allows 

%^ the user to seamlessly switch between browsing, viewing, authoring, and 

15 playing back the photographs and/or stories. In one embodiment, a 
multimedia storytelling system includes three components: a storage 
component, a display component, and an interaction component. The storage 
component stores digital media objects which are displayed by the display 
component. The system provides the interaction component (e.g., control 
20 buttons) that allow a user to navigate the digital media objects and create 

stories, or playlists. For purposes herein, a story, or a "playlist", is an ordered 
collection of digital media objects, such as, for example, images or video clips. 
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with one or more narration tracks. Imported stories on the top track may be 
merely a single photograph with or without a narration track. Photographs 
may be imported in batches, similar to that of a roll of fikn. Similarly, a 
collection may comprise the images of multiple pages in a single document. 
Thus, although these may not be a narration track, the imported collection of 
photographs or single photograph may represent a story for purposes herein. 

In one embodiment, the digital media stored in the system includes 
photographs, images, audio and video clips. The digital media may include 
raw objects, such as, for example individual photographs, as well as authored 
objects that combine multiple forms of media into integrated stories. An 
example of an authored object is a sequence of photographs with an audio, or 
narration, clip associated with or accompanying the photographs. In one 
embodiment, the time of capture for an object is known and is stored along 
with the object in the system. 

New objects may be input into the system automatically without 
requiring any action from the user. New objects are input into the system using 
one or more sources, such as, for example, on flash memory cards containing 
digital photographs, video capture camera, reading digital photographs or 
video clips from floppy or CD-ROM drives, network (e.g., Web) downloads, 
etc. In one embodiment, the user inserts a flash memory card into a slot in the 
system and the photographs from tiie flash memory card are automatically 
copied and stored in the system. In another embodiment, the system is able to 

8 07445 1.P108 



receive email messages with attached photographs and stories. The 
photographs and/ or stories are then automatically accessible. It would be 
apparent to one skilled in the art the various ways multimedia objects may be 
input into the system. 
5 A narration may take a variety of forms, such as, for example, recorded 

audio, digital ink, or t5rped notes. In one embodiment, a narration track may 
apply to a single object or to a number of objects (e.g., a narration track for a 
sequence of images). There may be multiple narration tracks for a single object. 
That is, the same digital object (photograph) may have several associated 

10 narrations. 

The display component of the system provides multiple tracks 
displaying digital media objects to the user. One track displays digital media 
objects that are stored in the system. A second track displays digital media 
objects that have been integrated into authored stories. The third track displays 

15 one or more digital media objects associated with a story ciurently being 
authored using the system. 

The display component of the system also allows the user to navigate 
through the objects in storage. This may include being able to browse through 
raw objects (e.g., objects without narrations associated or attached thereto), 

20 browse through stories, and play back previously created stories. In one 
embodiment, pla5ring back a story involves showing the user a slideshow of 
images along with any accompanying narrations. 
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The interaction component of the system allows the user to create new 
stories. In one embodiment, the user may create new stories using one or more 
of several approaches. For example, a user may author a story by starting with 
an empty or blank story, selecting objects to add to the story, and adding 
narrations to individual objects or a range of objects. The selection of objects 
and the addition of narrations may be performed in any order. A user may 
author a new story by performing edit or delete operations on an existing story. 

In one embodiment, the system has a modeless interface, giving the 
users an interface of a small ntimber of buttons and no pointing device or 
touchscreen. In another embodiment, the system may be implemented on a 
typical desktop computer or on a standalone appliance. The user controls for 
such a system may utilize standard input devices, such as, for example, a 
mouse, a keyboard, a touchscreen, a touch pad, or a physical control 
envirorunent of buttons and/or sliders designed specifically for the interaction 
described herein. 

Figure 1 illustrates one embodiment of a display screen and interface of a 
system. Media objects may be loaded onto the system from various sources, 
such as, for example, digital cameras, digital video capture, microphones, 
scanners, or may be created on the system itself. These objects are then stored 
in the system on a storage medium (e.g., random access memory, a hard drive, 
flash memory). 
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As illustrated in Figure 1, the system provides a visual iiiterface 100 
which splits the screen into three general areas. In one embodiment, the first 
area 102 at the top of the screen, the second, or central, area 103 at the bottom 
right side of the screen, and the third area 104 at the bottom left side of the 
screen. 

The first area 102 provides a graphical representation for browsing and 
navigating through media objects (e.g., photographs, etc.). In one embodiment, 
there are three horizontal tracks of thumbnail images, tracks 105, 110 and 115, 
each of which can be navigated by scrolling. 

In one embodiment, the top track 105 shows images of existing 
photographs ordered by time of creation (or other such features). The time of 
creation may be indicated by time stamps. These may be grouped into "rolls of 
film" or other natural clusters. In the case of scarmed prints, imported stories 
may correspond to literal rolls of film. In the case of digital photographs, the 
photographs may correspond to a set of photographs downloaded from the 
camera in one session. The top track 105 may also display all or many existing 
photographs currently stored in the system. That is, any image stored on the 
system may be displayed. The photographs displayed in the top track 105 may 
be stories. For example, one or more of the photographs in the top track 105 
may be an imported story. 

In one embodiment, the imported stories may be generated 
automatically as a result of a database query. For example, a database query for 
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"all images stored on a device containing blue" returns images having the 
"blue" identifier. The resulting images are then imported into the system. 
Note that there is no requirement that the first track 105 include any stories at 
all. That is, the first track 105 may contain zero or more stories, each of which 
having one or more images. 

Within an imported story, the photographs are ordered chronologically. 
Photographs from digital cameras may be ordered according to when they 
were taken, while images scanned from print photographs are ordered by 
scanning time. In one embodiment, in order to distinguish separate stories, the 
system uses alternating background colors with each color corresponding to 
different story. 

The middle, or second, track 110 contains authored stories, each 
including a sequence of one or more photographs selected by an individual. 
Each story appears as a sequence of thumbnail images. In one embodiment, the 
imported and authored stories are ordered according to their time of creation 
with separate stories being visually distinguished using different colored 
backgrounds. In another embodiment, stories in a track may be separated by 
spaces in the display. The size of the spaces may depend on the length of time 
between the story creation times. For example, in one embodiment, 
photographs taken on the same day are closer together on the second track 110 
than those separated by a day or a weekend. The space may be a function of a 



Ricoh 



074451.P108 



time duration, but not necessarily linear. For example, there might be set 
increments for "day", "weekend", "week" and "more than a week". 

The bottom, or third, track 115 represents a story being authored. That 
is, the bottom track 115 includes a working set of thumbnail images that have 
been selected and manipulated by the user. The set of images may only appear 
in the bottom track 115 for the current session with the system. If the user 
wishes to further author a story after a session has ended, the user selects the 
authored story for inclusion into the bottom track 115. 

In one embodiment, a thumbnail image appears in the bottom track 115 
if it has been added to the working set by pressing one or more control buttons 
(e.g., either of the + (add) or record buttons, as detailed below). At any time, 
the story being authored has a pending status. When the story is completed 
and saved by the user, it joins the set of authored stories in the middle track 
110. In one embodiment, the middle track 110 and the bottom track 115 need 
not be visually distinguished, nor for that matter, any of the tracks. 

The display of the three tracks 105, 110, and 115 enables an essentially 
modeless interface where a user can simultaneously view authored stories, 
view/ navigate through photographs and view/create new stories. The display 
also provides helpful context for viewing the current image. For example, 
when the user selects a thimibnail and views an image from one story, adds it 
to the cxirrent story, then adds aimotation, the context of that image with the 
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surrounding thumbnails in both the original and current story remains visible 
and helpful for annotation. 

In one embodiment, each thumbnail image appears exactly once in the 
top track 105. It may appear in any number of stories in the middle track 110 
and appears in the bottom track 115 only if it has been explicitly added to or 
armotated as part of the story currently being authored. 

In a typical interaction, the user comes across a thumbnail image and 
adds it to the working set (bottom track 115). The users may also record a 
related voice narration to accompany the selected thumbnail image. At the end 
of the session, all the thumbnail images in the bottom track 115 are grouped 
into a single story, and the story is then appended or added to the middle track 
110. 

The central area 103 of the screen display shows a large image 120. The 
large image 120 corresponds to a thumbnail image of a story or photograph 
currently being selected in the tracks in the first area 102. This allows the user 
to view photographs at a size that is comfortable and enjoyable (as opposed to 
the smaller thumbnail versions in tracks 105, 110 and 115). In another 
embodiment, other objects may require other display characteristics. 

The third area 104 of the screen display shows the available audio 
narrations of the photograph currently displayed in the central area 103. In 
another embodiment, other information, such as, for example, creation date 
may also be displayed in the third area 104. As shown in Figure 1, there are 

^i^o^ 14 07445 LP108 



two narrations 130 and 135 available for the photograph 120 in the central area 
103 indicated by the depiction of the wave signal. In one embodiment, each 
narration has a visual display corresponding to the length of time of the 
narration (e.g., the longer the signal wave the longer the narration), together 
with related recording information, such as, for example, time and date 
recorded, and the name of recording user. 

Figure 2 illustrates an exemplary embodiment of the control buttons and 
the display of a system for creating and/or viewing multimedia stories. In one 
embodiment, the control buttons are integrated into and positioned at the sides 
of the body of the system to enable the user manipulate the controls while 
holding the system in two hands. This further removes the need for a 
keyboard, mouse, or other device that forces the user to let go of the system. 

As illustrated in Figure 2, the control buttons in this embodiment include 
navigation buttons such as a scroll left button 205 to scroll the images in a 
selected track to the left, a scroU right button 210 to scroll the images in a 
selected track to the right, and a track selection button 215 to select the track to 
scroll. In one embodiment, a jog shuttle is used to control scrolling. 

In one embodiment, a colored (e.g., yellow) vertical line 255 indicates the 
selected track and the selected thumbnail image of this track. The vertical line 
255 is shown in the first thumbnail image of the bottom track. The selected 
thumbnail image corresponds to the large image displayed in the central area 
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250. The track selection button 215 moves the vertical line 255 between the 
three different tracks. 

When either of the scroll buttons 205 or 210 is pressed, the new selected 
thumbnail image on the current track either to the left or to the right side of the 
previously selected thumbnail image is selected and displayed in the central 
area 250. As a different thumbnail image moves under the vertical line 255, the 
corresponding image is displayed in the central area 250. Note that any type of 
indicator or indication may be used on the screen to select an image. It should 
also be noted that v^hile in one embodiment, the scroll button 205 or 210 causes 
the images in the track v^ith the vertical line 255 to move to the left or right 
while leaving the vertical line 255 in place, the system could keep the images in 
place while moving the vertical line 255 to select an image. In one embodiment, 
the navigation between the images may be controlled by an extemal joystick 
commurucating with the system through a joy stick interface. 

In one embodiment, the user may quickly traverse the thumbnail images 
on a track by using variable-speed scrolling. At standard speed, the display 
appears as shown in Figure 2. In one embodiment, standard speed refers to 
pressing one time on the left scroll button 205 or the right scroll button 210. 
This moves the track by exactly one thumbnail image. Faster scrolling speeds 
may be enabled by rendering low resolution versions of the thumbnail images 
that are quickly accessible from a separate index in the system and by not 
rendering the image in the central area 250 or the narrations in the audio area 
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260. In one embodiment, such images may be cached in memory to increase 
access speed. To increase scrolling speed, the user may need to hold down a 
scroll buttons 205 or 210 for a longer period of time. When this occurs, the 
images appear to be scrolling faster by the vertical line indicator 255. 

In one embodiment, an expand/ collapse button 265 controls the view or 
the number of photographs displayed for each story on one of the three tracks. 
In the expanded view, every thumbnail image in a story is shown. In one 
embodiment, the expanded view is the default view. In the collapsed view, 
each story is represented by a predetermined number of thimibnail images 
(e.g., one thumbnail image). In one embodiment, a first thtimbnail image of the 
story is used to represent the story. The collapsed view allows for faster 
navigation. Figure 3 illustrates an exemplary expand/collapse view of a story. 
Referring to Figure 3, the two stories 305 and 310, with thumbnail images 
belonging to one story having the same colored backgroimd, are shown in 
expanded form. The same two stories are illustrated in collapsed form in 
thumbnails 315 and 320, with the first thumbnail image of each story 305 and 
310 being used as the thumbnail image in the corresponding collapsed form. 

Referring back to Figure 2, the cluster of buttons at the bottom left of the 
system 200 provides the user with additional controls for authoring, browsing 
and playing back of stories. A play button 220 is used to enter a play mode to 
allow an individual to view existing stories. During the play mode, the system 
starts playback from the currently selected thumbnail unage. In one 
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embodiment, although not necessary, each thumbnail image has an audio clip 
associated with it as part of a story in which it resides. Furthermore, each 
thumbnail image may be part of more than one story, and thus may have more 
than one audio clip associated with it, one audio clip for each story. However, 
an audio clip may or may not contain any recorded narration. If a narration 
exists, it is played through a built-in loudspeaker in the system (not shown). If 
there is no recorded narration, there will be a pause of a certain length of time, 
for example one second. After the narration is played or after the pause (due to 
not having the narration), the selected track automatically scrolls forward to a 
next thumbnail image in the story. In one embodiment, the next thumbnail 
image is determined based on a default viewing and storytelling direction of 
left-to-right. If the user navigates to a new thumbnail image while the system is 
playing, for example scrolling left or right, changing to a different track, 
playback of the currently playing audio clip is stopped. If the user takes no 
further action for a brief length of time (e.g., one second), the system plays the 
audio clip associated with the newly selected thvimbnail image and continues 
with that story from that point on. 

As discussed above, the selected thimibnail image may have more than 
one associated audio clip and corresponding narrations. Whenever a thumbnail 
image is selected, all of the audio narrations associated with that thumbnail 
image are displayed in the audio area 260. In one embodiment, if the selected 
thumbnail image has more than one associated audio clip, and when the system 
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is playing an audio clip, pressing the play button 220 causes the system to 
advance to a next audio clip. 

Referring to Figure 1, there are two available narrations for the selected 
thumbnail image, narration 120 and narration 125. Each narration is marked 
with the time of recording and the name of the recording user. The length of 
the wavy lines is proportional to the duration of the audio. The narration 
associated with the selected story is listed first, as in narration 120. Referring to 
Figure 2, the narration associated with the selected story is played by default 
when the play button 220 is pressed. Pressing the play button 220 multiple 
times in quick succession selects one of the alternate audio clips and playback 
"jumps" to the corresponding story, providing a method of automatic 
hyperlinking between stories. 

The system remains in the play mode until the stop button 225 is pressed 
or until the end of story is reached. When this occurs, the system enters into 
stop mode. While in the stop mode, no audio clip is played by the system. 

In one embodiment, while the system is in the play mode, the remove (-) 
button 235 has no effect on the play mode of the system. However, pressing the 
add (+) button 230 adds the currently displayed image to the working set. 
Thus, the system appends the currently displayed image to the working set, but 
it does not stop the play mode. Furthermore, if the user presses the save button 
245 while the system is in the play mode, the system stops playing the audio 
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clip and performs a save operatioii. Furthermore, if the user presses the record 
button 240 while the system is in the play mode, the audio clip is stopped. 

In one embodiment, while in the authoring mode (not the play mode) a 
story may be created by pressing the + (add) button 230 when the vertical line 
5 255 is positioned over a particular thumbnail image to append a copy of that 
currently selected thumbnail image onto the working set or the bottom track 
115. The - (remove) button 235, conversely, removes the selected thumbnail 
image (i.e*, the image over which the vertical line 255 is positioned) from the 
J working set on the bottom track 115. 

m 10 While in authoring mode, pressing the record button 240 starts the 

g recording function. While the recording function is active, the audio recorded 

; " by the microphone is stored and associated with the currently displayed 

|:i , thumbnail image in the working track. In one embodiment, this is done by 

yl making a link in the underlying XML file. If the thumbnail image is not already 

15 in the working set, it is appended before the recording begins, as though the + 
button 230 was pushed first* If the selected thumbnail image is already on the 
working track, the new recording overwrites any previous recording associated 
with the thumbnail image in the story being authored. While recording, if the 
user selects a new thumbnail image, such as, for example by scrolling left or 
20 right, by changing to a different track, or by pressing the expand /collapse 
button, the recording continues for the audio clip associated with the new 
thumbnail image. In addition, the new thumbnail image is appended to the 
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working track, if it is viewed for more than a short time, (e.g. 1 second). 
However, if the user scrolls left or backward, but still remains on the working 
track, recording is automatically stopped. This prevents accidental erasures. In 
other embodiments, the new thumbnail image is inserted at a point before or 
after the thumbnail image last selected on the working track (e.g., the center 
image of track 3). In this situation, recording continues starting with the audio 
clip associated with the new inserted thumbnail image. Furthermore, pressing 
the play button 220 also stops the recording mode and put the system in the 
play mode. This makes recording a story as similar to viewing a story as 
possible. 

In another embodiment, the system may include a touchscreen or a 
pointing device (e.g., cursor control device). In this case, while in the recording 
mode, the user's pointing gestures would be captured. During playback, areas 
that had been pointed to are highlighted at appropriate times. One way to 
perform highlighting is to darken the rest of the image. Alternatively, a semi- 
transparent overlay can be used (e.g., a yellow circle). 

When the system is in the stop mode, the user may drag thumbnail 
images from one track to another, or the user may drag thumbnail images to 
different points within a story. Alternatively, a dragging operation can be used 
in place of scroll left/right to move all or multiple images in a track at the same 
time. 
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Pointing to a particular thumbnail selects the thumbnail image, selects 
the track the thumbnail image is on, and displays the corresponding large 
image in the central area 250. In another embodiment, touching the screen may 
highlight an image in a track but not select it. 
5 In one embodiment, the recording operation supports both "select then 

narrate" and "select while narrating" strategies. In "select then narrate" 
strategy, the users may compose a story by first selecting a working set of 
thumbnail images using the + button 230 or the - button 235 and then 
annotating each thumbnail images in order. Alternatively, in the "select while 
1 0 narrating" strategy, the users may continuously record narratioris while 
navigating and selecting photos. When recording is active, each new 
photograph that a user views for longer than a short time interval is 
automatically added to the working set along with any recorded input. This 
supports the "select while narrating" strategy. For soimd recording, in one 
15 embodiment, a microphone 270 is attached to or integrated into the external 
body of the system. In another embodiment, the device is equipped with a 
speech recognition software to translate audio narration into text. 

The group of buttons at the bottom right of Figure 2 controls story 
operations. The save button 245 "saves" the current working story that is 
20 displayed on the bottom track by moving it to the end of the middle track. 
Note that the current state of the system is also saved to the system storage 
device at that time. The XML files (or equivalent representations) along with 
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audio clips may be stored in memory (e.g., RAM) and writteii out to a hard disk 
when saved. (They may also be written out at other times.) In another 
embodiment, the user would also have the option of electronically sending a 
completed story to another user for viewing on a similar system or on a regular 
5 PC via a media player application or standard Web browser. In another 
embodiment, a "print" button allows the user to print a selected image on a 
locally connected, via wire or wireless technology, printer or a remote device. 

In one embodiment, the system has an attached image 
creation/ capturing device (e.g., a video camera). The video camera can point 

10 inward at the holder of the system or it can point outward. A take-picture 

button on the system allows the user to take a still image from the camera, time- 
stamp, and add the image to the top track. In another embodiment, an image of 
the narrator is automatically grabbed by the camera at the begirming of every 
recording session, or at pre-set intervals, or every time the video input changes 

15 by more than some predetermined threshold. Using the video camera, the 
images captured by the video camera may be added to the first track 105 and 
available for inclusion in authored stories like all other objects in the system. 

In one embodiment, stories and metadata about photographs are stored 
on the storage device in Extensible Markup Language (XML). Figure 4 

20 illustrates an exemplary XML metadata file that the system reads during 
initialization. The iiutializing metadata file 400 is stored in a predetermined 
known location on the storage device. In one embodiment, the initializing 
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metadata file is named "families.xmr' and the location of the file is determined 
using a local Uniform Resource Locator (URL) corresponding to the file on the 
storage device of the system. For each possible group (e.g., family) using the 
system, the initializing metadata file contairis pointers to the stories 
5 corresponding to the three tracks. For example, the metadata file of Figure 4 
points to a story file 405 "meta6.xmr'. In one embodiment, the initializing 
metadata file also maintains further administrative functions, such as, for 
example, user ids and passwords to prevent unauthorized viewing of personal 
stories* In one embodiment, the initialization file contains user specific 

10 information. In another embodiment, the system assumes a single user. 

Figure 5A illustrates an exemplary XML metadata for a story. The XML 
metadata file for story 500 points to two other stories, "metaS.xml" 505 and 
"meta5.xmr' 510. Note that each metadata file has an associated "type" 
attribute having a value of either "story" or "document" indicates single object. 

15 Figure 5B illustrates another exemplary XML metadata file for a story. 
Metadata file 520 represents the story "metaS.xml" 505 pointed to by the 
metadata file 500. Metadata file 520 has a list of photographs shown as 
"metal.xml" 525 and "meta2.xml" 530. Note that the photograph references are 
in the form of the URL of the corresponding metadata XML file. Each 

20 photograph may also have an associated audio clip shown as "clipl.au" 535 
and "clip2.au" 540. In one embodiment, the offset attribute of the audio tags 
associated with audio clip 535 or audio clip 540 specifies a starting location of 
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the referenced audio narration (such as when, for example, one audio file 
contains more than one user-supplied audio narration) within the audio file and 
is measured in milliseconds. 

Figure 6 illustrates an exemplary XML metadata file for a single 
photograph. Metafile 600 represents the image "metal.xml" pointed to by the 
metafile illustrated in Figure 5B. Metafile 600 includes URLs that point to two 
image files, a full size image 605 that may be used in area 103 of the display and 
a thumbnail image 610 that may be used in the tracks 105, 110, and 115. In 
addition, the metafile 600 includes color data for a reduced representation. The 
reduced representation of the image may be rendered either as a thtmibnail 
image or as a full size. 

In one embodiment, using XML allows for easy translation to other 
formats such as Hypertext Markup Language (HTML) or Synchronized 
Multimedia Integration Language (SMIL). This enables stories to be shared 
with others and viewed on different devices. The SMIL format is especially 
appropriate as it allows the synchronization of audio with a series of images to 
match the structure of the stories. Using any of these formats, a story including 
the associated photos and audio clips can be saved in a file. The file may then 
be uploaded to a web server and assigned a imique URL. The URL may then be 
sent on the web to specified recipient or list of recipients by email using the 
standard email protocols, for example SMTP. In one embodiment, to send the 
electronic mail (email), the system provides a network interface, for example 
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Ethernet, modem, or wireless. Alternatively, the file may be sent to the 
recipient(s) as an email attachment using a standard attachment encoding 
technique, for example Multipurpose Internet Mail Extensions (MIME). 

In one embodiment, the email address of the recipients may be specified 
5 using the scroll left/right and the add (+)/remove (-) buttons. These buttons 
are used to navigate among letters and punctuation symbols displayed on the 
top track 105. An email address may be constructed on the bottom track 115. 
The middle track may be used to display previously saved email addresses. 

Q 

Email addresses of people who send stories to the users may automatically be 
P 10 added to the second track. Furthermore, the system may accept input email 
U addresses through a "contact" file or vCard file (a Personal Data Interchange 

(PDI) technology developed by the Versit Consortium) beamed by IR from a 
ill PDA (e.g., a Palm Pilot). To generate a i\ew email address, the user may use a 

W stylus to type on an on-screen keyboard. Using the stylus, handwriting 

1 5 recognition software, for example Graffiti by Palm Computing, may be used to 
recognize the user's handwriting. Alternatively, the user may use an external 
keyboard plugged into a keyboard interface provided by the system. Besides 
sending stories to email recipients, the system may also automatically check for 
new stories, for example, in the form of email messages or a predetermined web 
20 page listing new stories sent to the device/owner, at set intervals or at set times 
of day. The system may also automatically download any new stories or 
messages. 
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In one embodiment, the system includes a cradle or docking station that 
includes power and network interface. In one embodiment, all communication 
(sending /receiving messages) is queued for execution when the system is 
docked. The communication then occurs automatically when docked. In one 
embodiment, while the system is in the cradle or otherwise imused for a set 
period of time, an auto play screen saver feature is activated causing the screen 
to cycle through all the stories in the system. 

In one embodiment, the system may create a movie file using the images 
and the associated audio clips in synchronization with each other, just as they 
are shown on the system when the user presses the play button. The movie file 
may then be stored in a standard format, such as, for example MPEG, AVI, or 
QuickTime, and may then be sent to the recipient(s) as an email attachment. In 
another embodiment, the author of a story may choose to send his or her own 
photograph to be associated with a story when the story is sent to a recipient. 
A link to this photograph is included and associated with the email. When the 
recipient views the email, the photograph of the author is displayed in an 
image slot. Alternatively, the photograph associated with the author might also 
be displayed as the first photograph in a story, or somehow combined with the 
first photograph especially for the collapsed view. 

As discussed above, the system may accept as input media objects from 
various sources. In one embodiment, other media objects, for example video 
clips, presentation slides, pages of documents, web pages, and audio clips may 

Ricoh 27 07445 LP108 



f 



also be loaded onto the system. With the video clips as input, an incoming 
video stream can be captured through a standard analog video capture board, 
or via a "firewire" (IEEE 1394) interface, or by direct connection to an already- 
digitized video file, for example a MPEG file, or a QuickTime file. The stream 
5 can then be segmented into clips using standard scene-change detection 

algorithms. Each clip can then be treated as a photograph. Furthermore, each 
clip may already have an associated audio segment from the original video 
source. In this embodiment, pressing the play button enables the system to 
play both the audio segment and the video clip synchronized with each other. 

10 The system then moves on to a next clip of the story. A first frame of the video 
clip may be used as a thumbnail image representing the video clip. 

With presentation slides as input, the system creates an image for each 
presentation slide. The presentation slides may have been generated by 
presentation software (e.g., Powerpoint) or other document or graphics editing 

15 tools. The story is then viewed on the system as a sequence of slides, similar to 
a presentation. With the pages of document as input, a story in the top track 
corresponds to the pages of an original document, in order. A story in the 
middle track is a collection of reorganized pages with added audio 
commentary, for example a summary. Documents can easily be imported from 

20 existing formats, both page-based, for example Postscript, or PDF, and linear, 
for example HTML, or plain text. In the latter case a pagination operation 
would be required. 
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With Web pages as input, the top track 115 may contain a user's 
chronological browsing history. This may be captured by monitoring a Web 
browser or by acting as a proxy server intercepting and serving all of a user's 
HTTP requests. Each story corresponds to a length of time (e.g., one day or one 
5 browsing session, etc.). The second track allows the user to compose and 

sequence groups of Web pages for easy access and later retrieval. In the second 
track, a story might correspond to a bookmark folder, a series of pages foimd in 
the course of one search, or any other structure as is useful to the user. 

With the audio clips as input, the top track may represent "albums" or 

10 audio CDs that the user has scarmed into the system via a CD-ROM drive, or 
downloaded from the Web directly as a series of audio files, for example MPS 
files from MP3.com, Windows Media Audio files from Microsoft, Liquid Audio 
files from LiquidAudio.com, Realjukebox files from Real Networks, etc. Each 
"album" is an ordered set of "songs". The second track represents the user's 

15 "playlists" (e.g., the user's own sequences of songs to be played). In this case, 
there is a default visual representation for each song, or the user chooses 
relevant icons, or a Web service provides icons to match artist names, song 
titles or album titles. 

Figure 7 illustrates an embodiment of an exemplary computer system 

20 that comprises the storage, display and interaction components of the digital 
story creation and play back system. The various components shown in Figure 
7 are provided by way of example. Certain components of the computer in 
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Figure 7 can be deleted from the system for a particular implementation of the 
invention. In other systems, additional components may be added without 
affecting the scope of the present invention. In other systems, additional 
components may be added without affecting the scope of the present invention, 
5 Figure 7 illustrates a system bus 700 to which various components are 

coupled. A processor 701 performs processing tasks required by the computer. 
Processor 701 may be any type of processing device capable of implementing 
the steps necessary to perform the storage, displaying, and interaction 
capabilities described herein. An input/output (I/O) device 704 is coupled to 

10 bus 700 for communicating with other devices coupled to the computer. A 
read-only memory (ROM) 703 and a random access memory (RAM) 705 are 
coupled to bus 700 to provide storage for various data and information used by 
the computer. Although ROM 703 and RAM 705 are shown coupled to bus 700, 
in alternate embodiments, ROM 703 and RAM 705 are coupled directly to 

15 processor 701 or coupled to a dedicated memory bus (not shown). 

A video display 710 is coupled to bus 700 and displays various 
information and data to the user of the computer. A disk drive 702 is coupled 
to bus 700 to provide long-term mass storage of information. In one 
embodiment, a pointing device 708 is coupled to bus 700 for entering 

20 information and commands to the computer system. In another embodiment 
two displays may be used, one low resolution display for the 3 tracks and a 
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high-resolution display for the main viewing area. The main viewing area may 
not be needed for audio and other media. 

From the above description and drawings, it will be understood by those 
of ordinary skill in the art that the particular embodiments shown and 
described are for purposes of illustration only and are not intended to limit the 
scope of the invention. Those of ordinary skill in the art will recognize that the 
invention may be embodied in other specific forms without departing from its 
spirit or essential characteristics. References to details of particular 
embodiments are not intended to limit the scope of the claims. 
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CLAIMS 



What is claimed is: 



1 1. A apparatus comprising: 

2 means for displaying a first list of reduced visual representations of a 

3 plurality of media objects ordered automatically and grouped by 

4 media objects relating to one another; 

5 means for displaying a second list of reduced visual representations of the 

6 plurality of media objects ordered manually; 

7 means for navigating among the reduced visual representations; and 

8 means for displaying the plurality of media objects. 

1 2. The system of claim 1 wherein the first list comprises of imported stories, 

2 and the second list comprises of authored stories. 

1 3. The system of claim 1 further comprising means for recording a narration 

2 for an audio clip associated with each media object. 

1 4. The system of claim 1 wherein the display means displays imported stories, 

2 the authored stories, and/ or a representation for each associated audio clip 

3 for a selected object in the imported stories or the authored stories. 

1 5. The system of claim 4 wherein the imported stories are displayed on a first 

2 track, and the authored stories are displayed on a second track and on a 
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3 third track, the authored story on the third track being a story being 

4 created. 

1 6. The system of claim 1 further comprising means for moving the authored 

2 story from the second track to the third track such that a user can edit the 

3 authored story. 

1 7. The system of claim 1 wherein means for displaying a first list of reduced 

2 visual representations of a plurality of media objects comprises means for 

3 displaying a series of audio files. 

1 8. The system defined in Claim 7 wherein the audio files are from a group 

2 comprising MPS files. Liquid Audio files, and Realjukebox files, WAV files, 

3 or other compressed or uncompressed audio file formats. 

1 9. The system of claim 1 wherein means for displaying a first list of reduced 

2 visual representations of a plurality of media objects comprises means for 

3 displaying a plurality of video clips. 



1 10. The system defined in Claim 9 wherein the video clips are from a group 

2 comprising MPEG files, QuickTime files, AVI files, and RealVideo files. 



1 11. A method comprising: 

2 displaying a first list of reduced visual representations of a plurality of 

3 media objects ordered automatically and grouped by media objects 

4 relating to one another; 
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5 displaying a second list of reduced visual representations of the plurality of 

6 media objects ordered manually; 

7 navigating among the reduced visual representations; and 

8 displaying the plurality of media objects. 

1 12. The method of claim 11 wherein the first list comprises of imported stories, 

2 and the second list comprises of authored stories. 

1 13. The method of claim 11 further comprising recording a narration for an 

2 audio clip associated with each media object. 

1 14. The method of claim 1 wherein displaying the plurality of media objects 

2 comprises displaying imported stories, the authored stories, and/or a 

3 representation for each associated audio clip for a selected object in the 

4 imported stories or the authored stories. 



1 15. The method of claim 14 wherein the imported stories are displayed on a 

2 first track, and the authored stories are displayed on a second track and on 

3 a third track, the authored story on the third track being a story imder 

4 construction. 

1 16. The method of claim 11 further comprising moving the authored story 

2 from the second track to the third track such that a user can edit the 

3 authored story. 
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1 17. The method of claim 11 wherein displaying comprises a first list of reduced 

2 visual representations of a plurality of media objects comprises displaying 

3 a series of audio files. 

18. The method defined in Claim 17 wherein the audio files are from a group 
comprising MPS files. Liquid Audio files, and Realjukebox files, WAV files, 
or other compressed or uncompressed audio file formats. 



1 19. The method of claim 11 wherein displaying a first list of reduced visual 

2 representations of a plurality of media objects comprises displaying a 

3 plurality of video clips. 

1 20. The method defined in Claim 19 wherein the video clips are from a group 

2 comprising MPEG files, QuickTime files, AVI files, and RealVideo files. 

1 21. A system comprising 

2 a storage device having a plurality of stories, each story comprising of a 

3 plurality of objects; and 

4 a processor in communication with the storage device, the processor: 

5 display a first story track for a plurality of imported stories which 

6 have objects automatically ordered, 

7 display a second story track for a plurality of authored stories, 

8 display a full size image of a selected object in the story. 
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9 process navigation input from a user, the navigation input comprising 

10 moving a track selection from one story track to another story 

1 1 track, moving an object selection from one object to another 

12 object, and 

13 process operational input from the user, the operation input 

14 comprising playing the story, stopping recording or the playing, 

1 5 and saving the story. 

1 22. The system of claim 21 wherein process operational input from the user 

2 further comprising recording a narration for the selected object. 

1 23. The system of claim 21 wherein the processor is further operable to display 

2 a third story track for a story being constructed, and wherein the 

3 navigation input further comprising adding an object to the third track, 

4 and removing an object from the third track. 

1 24, The system of claim 21 wherein the processor is further operable to display 

2 a plurality of narrations associated with the selected object. 

1 25. The system of claim 21 wherein the objects in the imported stories are 

2 automatically ordered by time. 

1 26. The system of claim 21 further comprising an object input device to input 

2 new objects, the new objects comprising imported stories, digital 

3 photographs, video clips, pages of documents, presentation slides, audio 

4 clips, and web pages. 
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1 27. The system of claim 21 further comprising a docking cradle for 

2 commimication and an output device to send a story to a recipient's email 

3 address in the form of email attachment. 

1 28. The system of claim 27 further comprising sending the story to a web 

2 server, assigning imique URL to the story, and sending the URL to the 

3 recipient by email 

1 29. The system of claim 21 further comprising a recording device to record a 

2 narration for the audio clip, the recording device being one of a group 

3 comprising voice activated recording and microphone recording. 

1 30. The system of claim 21 wherein 

2 the selected object is in one or more stories, 

3 the selected object has one or more associated audio clips, 

4 each audio clip is associated with one story, and 

5 each audio clip has zero or one narration. 

1 31. The system of claim 21 wherein displaying an audio clip comprises 

2 displaying all audio clips associated with the selected object, and wherein 

3 the audio clip associated with the story is displayed as a current audio clip, 

4 the current audio clip is played before all other audio clips. 

1 32. The system of claim 21 wherein the story under construction in the third 

2 story track is placed at the end of the second story track when the 
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3 construction is completed and wherein the story is saved in the storage 

4 device. 

1 33. The system of claim 32 further comprising grouping objects in the third 

2 story track and recording a narration for each object, and wherein saving 

3 the story comprises saving the objects and the associated audio clips. 

1 34. The system of claim 32 wherein the story and the associated objects are 

2 saved as files in the storage device using a markup language format. 

1 35. The system of claim 34 wherein the markup language format comprises 

2 HTML, SMIL, or XML. 

1 36. The system of claim 21 wherein playing the story comprises 

2 selecting a story from the first track, from the second track, or from the 

3 third track using the navigation input, 

4 activating a play operation, and 

5 viewing the full size image corresponding to each of the plurality of objects 

6 in the selected story. 

1 37. The system of claim 36 wherein viewing the image comprises 

2 selecting the objects in the story in a sequence, wherein the sequence is 

3 from beginning of the story to end of the story, and 

4 playing the audio clip corresponding to the selected object. 
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1 38. A method comprising: 

2 selecting a plurality of objects, the selected objects coming from a plurality 

3 of stories displayed on a first track having one or more image objects 

4 displayed thereon or a second track containing one or more 

5 previously generated stories of one or more image objects stored as a 

6 story, the selected objects placed in a third track to form a story; 

7 recording a narration for each selected object and associating the narration 

8 with the selected object; 

9 saving the authored story in the second track; and 
1 0 playing the story. 

1 39. The method of claim 38 further comprising 

2 displaying the import track, the author track, and the working track at all 

3 times, 

4 displaying a full size image of the selected object, 

5 displaying one or more audio clips associated with the selected object. 

1 40. The method of claim 38 wherein the audio clip for the selected object 

2 associated with the story is displayed as a current audio clip, the current 

3 audio clip is played before all other audio clips for the selected object. 

1 41. The method of claim 40 wherein playing the audio clip further comprises 

2 selecting an alternative audio clip other than the current audio clip. 
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42. 



The method of claim 38 wherein the selected object has one or more 
associated audio clips, wherein each audio clip associated with one 
authored story, and wherein each audio clip has zero or one narration. 



1 43. The method of claim 38 wherein each story in the import track or in the 

2 author track has a different colored backgroxmd from its adjacent stories. 

1 44. The method of claim 38 wherein playing the story comprises 

2 selecting a story from the import track or from the author track, and 

3 viewing the full size image corresponding to each object in the selected 

4 story. 

1 45. The method of claim 44 wherein viewing the full size image comprises 

2 selecting the objects in the story in a sequence, wherein the sequence is 

3 from beginning of the story to end of the story, and 

4 playirig the audio clips associated with the selected objects, 

1 46. The method of claim 38 wherein the authored story and its associated 

2 objects are saved as files using a markup language format. 

1 47. The method of claim 46 wherein the format of the markup language 

2 comprises HTML, SMIL, or XML. 
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48. 



The method of claim 38 wherein the objects in the import track, the author 
track and the working track are displayed as thumbnail images or reduced 
representation of the corresponding objects. 



1 49. The method of claim 38 wherein selecting an object comprises displaying 

2 the thumbnail images in high resolution if the selection is done in standard 

3 speed and displaying the thumbnail images in low resolution if the 

4 selection is done in high speed. 



1 



50. The method of claim 49 further comprising not displaying the image and 
2 the associated audio clips when the selection is done in high speed. 

1 51. The method of claim 38 further comprising receiving new objects from 

2 external devices, the new objects comprising imported stories, digital 

3 photographs, video clips, pages of documents, presentation slides, audio 

4 clips, and web pages. 

1 52. The method of claim 38 further comprising sending a story to a recipient's 

2 email address in the form of an email attachment. 

1 53. The system of claim 38 wherein each imported story is an automatically 

2 constructed group, each imported story is visually distinguishable from 

3 another, and each authored story is visually distinguishable from another. 
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1 54. The system of claim 52 wherein each story on the same track has a different 

2 colored backgroimd, and wherein each story is separated from the other by 

3 a gap. 

1 55. The system of claim 54 wherein the gap represents a time difference 

2 between story creation times. 

1 56. The system of claim 21 further comprising a pointing device configured to 

2 enable the user to perform track selection, object selection, and moving of 

3 the selected object from one track to another track, the pointing device 

4 being one from a group comprising a mouse, an external joy stick, a voice 

5 activated control device, a touch screen, a track pad, and a cursor control 

6 device. 

1 57. The system of claim 21 further comprising an attached video camera, the 

2 video camera used to add new objects to the first track. 

1 58. The system of claim 21 where in the objects in the first track, the second 

2 track and the third track are displayed as thumbnail images or in reduced 

3 representation of the corresponding objects. 

1 59. The system of claim 42 wherein a first thumbnail image for each story is 

2 used to represent the corresponding story when the tracks are configured 

3 to display in collapse form. 
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1 60. The system of claim 21 wherein moving the object selection from one object 

2 to another object comprises displaying the thumbnail images in high 

3 resolution if the movement is in standard speed and displaying the 

4 thumbnail images in low resolution if the movement is in high speed. 

1 61. The system of claim 60 further comprising not displaying the thumbnail 

2 image and the associated audio clips when the movement is in high speed. 

1 62. A system comprising: 

2 means for storing digital stories, the digital stories comprising of imported 

3 stories and authored stories, each story comprising of a plurality of 

4 objects; 

5 means for authoring stories using stored objects; 

6 means for displaying story tracks associated with the imported stories and 

7 the authored stories; and 

8 means for navigating among the story tracks and among the objects on a 

9 story track. 

1 63. The system of claim 62 wherein means for displaying story tracks 

2 comprises means for displaying imported stories in a first track, means for 

3 displaying authored stories in a second track, and means for displaying a 

4 story being constructed in a third track. 
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1 64. The system of claim 62 wherein means for authoring stories comprises 

2 means for selecting an object, means for recording a narration for an audio 

3 clip associated with the selected object, and means for saving the authored 

4 stories in the storing means. 

1 65. The system of claim 64 wherein the authored stories are saved using XML, 

2 SMIL, or HTML file format. 

1 66. The system of claim 62 further comprising means for displaying a full size 

2 image of the selected object, means for displaying one or more audio clips 

3 associated with the selected object, and means for playing the audio clip. 
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ABSTRACT OF THE DISCLOSURE 

A method and system for digital story authoring, viewing and storing is 
described. A display showing an imported story track, an authored story track, 
and a story-in-progress track. Each story contains one or more objects. The 
5 display also shows a large photo corresponding to a selected object in a story. 
Furthermore, the display also provides visual forms showing audio clips 
associated with the large photo displayed. The photo may have more than one 
associated audio clips. Control buttons are available to navigate among the 
three tracks and among the objects on each track. Other control buttons allow a 
10 user to record audio clips and to author new stories. 



42390.P4404 



-45- 



Expand 





FIGURES 



0 

ffl 

m 
m 
m 

w 

yl 

m 
m 



o 




A A 

:3 S 



A 

w 

0) 



V 



5 



o 



I lie I 



O 



ft 

CO 
Cl 

II 

u 

0) 



i 

0) 



HI w 

HI CI <^ i 

•H 43 -ri ' 



O 



CO (Q 
^ A ^ 



Pi 
43 

s 
II 

A 

M -H 

0) 0) 

c o 

■H ^ 



CI 

«} 

43 

i 

in 

I 

9 0) 

CQ CO 

^ A ^ 



H 
O 

U 

I 



ft 

I o 



A 
U 

<u 
a 

•H 
O 

CO ft 



I 

o 



0) -H »o nd Di 

' Ti ;3 ^3 (d 

(d (« S 

V \ -H 




43 

:H 

s 

li 

4J 

(C 

A O 
0) V 
C7) O 

g ft 
•H V 
V 



Oi 

43 



0) 0) 



A 
C! 

O -H 



4J O 
03 ft 



■H ^ Td di 

nci :3 0 (d 
B 



(d (d 

V 



A 



Pi " 

43 A ft 

43 43 

II 

(d O li 

4-» E< (d 

(d w 4-> 

^0 ? (d 
II 



Pi 

43 
4) 

.a 



II PES 

(d O 



Pi 

43 
43 



(d 

i 



0) 



A O 

0) 4J 

t3) O ft . 

d ^ S o 

g p, 4-) Tt 

•H V V V 
V 



(d 

w ^ 

II *d 
(D (d 

4-> g 



A 

0) A O 

m O) g ft ^ (d 

fi g ^ S o g 

•H S ft 4J -H 

\ -H V V V \ 

V V V 



A 

& 

O 
-IJ 
CO 



Ml 



■ 



=:n:i:::i=M. 





























CO 




























1 




1 


1 


I 


CO 






































ts, 


























1 


00 


CO 


u) 


C4 


CO 


CO 
































CO 


00 


00 


in 


1 






























Q 


Q 




o 








s 


















w 




vo 


in 










in 




H 


























o 








o 


<n 


o 


















M 


00 










1 


H 


d 


00 


H 


















00 




rH 






in 




H 


d 


vo 




















KO 








1 


1 


1% 


CO 


CO 


Q 


CO 


















CO 




00 


o 


00 


CO 


H 
Ol 


00 

d 


CO 

d 


CO 

d 


o 

CO 




















U) 






H 


00 


<n 


H 


H 


H 


H 


















vo 


<NI 


CO 






H 


in 


1 






1 




















10 


en 


in 


w 


CI 


cn 


























00 


in 


m 


CI 




CI 




CO 


fO 


^ 


CO 






















CO 


o 






1 


CO 


CO 


vo 












A^^ 










CO 


Ci 


CO 




in 




CO 


o 


o\ 


00 








A 










o 




1 


1 






vo 


H 

in 


in 

CO 


vo 
d 


a\ 
d 




















Ci 


o 


c\ 


00 


o 


CO 


d 


vo 


d 


o 










h i* 










00 




H 


00 


en 


o 


H 


d 


CO 


CO 










Pi Q 










00 




o 


d 


CO 


CO 


H 


H 


H 


H 








Pi 


•n Pi 
















in 


vo 


CO 


1 


1 


1 


1 








"n 


* 








a 




C4 


in 


CO 


in 


vo 
















• 










CO 




O 


in 




vo 


1 




vo 


VO 


VO 








H 










vo 


m 


CO 


CO 




in 






in 


in 


00 












A 




A 


vo 


1 


1 




1 




H 


in 


vo 


vo 


in 








1 


1^: 




A 
















H 




o 


o 


m 










V. o 








1 




H 


00 


<Ti 


CI 


CO 




vo 


vo 


d 










CO 


£ 














VO 


CO 




d 


H 


H 


o 










U 


CO 


= 




o 






H 


VO 






T 


CO 


CO 


CO 










a» o 


m 






o 


H 




<n 


CI 


CO 


vo 




H 


H 


H 










1^ cn 


c« 




ll 






00 


H 


CO 




vo 




1 












I 


















<n 


CO 


1 


d 
















O H 


"it 


II 




Ok 




CO 


CO 




in 








d 


VO 










CQ t 








CO 


1 




1 


1 




H 








00 










^ to 






O 


in 












o 


00 


vo 


H 


in 










^ o 






H 


1 


in 


o 




CO 


o 


o 


o 


in 


vo 


cy> 










« 1 


o 


O 






^o 


CJ 




H 


CO 


00 


H 


d 


d 


d 








•« 


Qi 0\ 


\£> 


«o 


II 


"ll 


in 




Ok 


VO 


H 


10 


d 


d 


d 


o 




A 




Ck a\ 


CO 


s 




CQ 


H 




H 


o\ 


00 


m 


H 


CO 


CO 


CO 




0- 








s 


II 




U 


H 


CI 




CI 


CO 


CI 




H 


H 


H 










^ H 


11 






0 


o 


o 


00 


00 




H 




1 


1 


1 










5 = 


Xi 




O 


iH 




CO 


H 


CI 




1 


d 










o 






tl II 






•H 


O 


1 




1 




1 




O 


CO 


vo 


o 








II 


no 0) 




0 


CQ 


o 












CO 


m 


H 


in 


d 




rH 








o 


•H 












H 


CI 


c\ 


d 


d 


vo 










•H 


•H 


•H 


CQ 


i 


IQ 


o 


o 


00 


in 


H 


o 


vo 


H 


o 


d 




II 














1^ 


vo 


in 


a\ 


in 


CI 


in 






vo 


CO 




a 




<1) 


tn 


a 




•H 


0 


n 


H 


CI 


<n 


^ 




d 


d 


H 


CO 




o 




CD 










iH 


00 


vo 


H 




CO 




H 


CO 


CO 


d 




•H 






§ 




•H 


1 


O 


vo 




in 


in 




H 


1 


tH 


H 


H 


















u 


? 


c« 


H 


H 




H 




1 




1 




u 




•H 


1 4J 






0) 


1 






i 








m 














1 




1 




U 
















^ 






^ 


A 


> 


A 


iH 


e 0) 










00 


CO 




CO 


00 


VO 


d 


'J 


d 


CO 


0 




O 


H 










H 




CO 




vo 


0) 


d 


CO 


CO 


H 


4J 




4J 


:^ 




:i 






o 




^ 


CO 


d 


CO 


vo 




o\ 


H 


0 




o 




4J JJ 






u 


4J 


H 


H 


CO 


in 


o 


a\ 






d 


O 








V 


V V 


V 


V 


V 


V 


in 


VO 


H 






00 


d 


o 


o 






















in 




H 


vo 


vo 


CI 


H 


CO 


CO 


d 




V 


V 














CO 


CI 


H 


H 




00 


1 




H 


H 


V 



o 
o 



Pointing Device 
708 



Processor 
701 




Input / Output 
Device 
704 




Disk Drive 
702 



Read-only Memory 
703 



700 



FIGURE 7 



Attorney's Docket No.: 074451. P1 08 Patent 
DECLARATION AND POWER OF ATTORNEY FOR PATENT APPLICATION 



As a below named inventor, I hereby declare that: 

My residence, post office address and citizenship are as stated below, next to my name. 

I believe I am the original, first, and sole inventor (if only one name is listed below) or an original, 
first, and joint inventor (if plural names are listed below) of the subject matter which is claimed and 
for which a patent is sought on the invention entitled 
METHOD AND APPARATUS FOR STORYTELLILNG WITH DIGITAL PHOTOGRAPHS 



the specification of which 

X is attached hereto. 

was filed on as 

United States Application Number 

or PCT International Application Number 

and was amended on 

(if applicable) 

I hereby state that I have reviewed and understand the contents of the above-identified 
specification, including the claim(s), as amended by any amendment referred to above. I do not 
know and do not believe that the claimed invention was ever known or used in the United States of 
America before my invention thereof, or patented or described in any printed publication in any 
country before my invention thereof or more than one year prior to this application, that the same 
was not in public use or on sale in the United States of America more than one year prior to this 
application, and that the invention has not been patented or made the subject of an inventor's 
certificate issued before the date of this application in any country foreign to the United States of 
America on an application filed by me or my legal representatives or assigns more than twelve 
months (for a utility patent application) or six months (for a design patent application) prior to this 
application. 

I acknowledge the duty to disclose all information known to me to be material to patentability as 
defined in Title 37, Code of Federal Regulations, Section 1 .56. 

I hereby claim foreign priority benefits under Title 35, United States Code, Section 1 19(a)-(d), of any 
foreign application(s) for patent or inventor's certificate listed below and have also identified below 
any foreign application for patent or inventor's certificate having a filing date before that of the 
application on which priority is claimed: 
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Prior Foreign ADDlication(s) 



Priority 
Claimed 



(Number) (Country) (Day/Montli/Year Filed) Yes No 



(Number) (Country) (Day/MontlVYear Filed) Yes No 



(Number) (Country) (Day/MonthA'ear Filed) Yes No 

I hereby claim the benefit under title 35, United States Code, Section 119(e) of any United States 
provisional application(s) listed below: 



(Application Number) Filing Date 



(Application Number) Filing Date 



I hereby claim the benefit under Title 35, United States Code, Section 120 of any United States 
application(s) listed below and, insofar as the subject matter of each of the claims of this application 
is not disclosed in the prior United States application in the manner provided by the first paragraph 
of Title 35, United States Code, Section 1 12, 1 acknowledge the duty to disclose all information 
known to me to be material to patentability as defined in Title 37, Code of Federal Regulations, 
Section 1,56 which became available between the filing date of the prior application and the national 
or PCT international filing date of this application: 



(Application Number) Filing Date (Status - patented, 

pending, abandoned) 



(Application Number) Filing Date (Status patented, 

pending, abandoned) 

1 hereby appoint the persons listed on Appendix A hereto (which is incorporated by reference and a 
part of this document) as my respective patent attorneys and patent agents, with full power of 
substitution and revocation, to prosecute this application and to transact all business in the Patent 
and Trademark Office connected herewith. 

Send correspondence to Michael J. Mallie , BLAKELY, SOKOLOFF, TAYLOR & 

(Name of Attorney or Agent) 
ZAFMAN LLP, 12400 Wilshire Boulevard 7th Floor, Los Angeles, California 90025 and direct 

telephone calls to Michael J. Mallie , (408) 720-8598. 

(Name of Attorney or Agent) 
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I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these 
statements were made with the knowledge that willful false statements and the like so made 
are punishable by fine or imprisonment, or both, under Section 1001 of Title 18 of the United 
States Code and that such willful false statements may jeopardize the validity of the 
application or any patent issued thereon. 

Full Name of Sole/First Inventor Marko Blabanovic - 

Inventor's Signature ^ Date ^ 

Residence Citizenship 

(City, State) (Country) 

Post Office Address — 



Full Name of Second/Joint Inventor Lonnv L Chu 



Inventor's Signature ^ Date . 



Residence . - Citizenship 

(City, State) (Country) 

Post Office Address ^ — 

Full Name of Third/Joint Inventor Gregory L. Wolff 

Inventor's Signature . Date 



Residence Citizenship 

(City, State) (Country) 

Post Office Address 



Full Name of Fourth/Joint Inventor „ 

Inventor's Signature ». Date 

Residence — Citizenship 

(City, State) (Country) 

Post Office Address .„ 
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APPENDIX A 



William E. Alford, Reg. No. 37,764; Farzad E. Amini, Reg. No. P42,261; Aloysius T. C. AuYeung, Reg. No. 
35,432; William Thomas Babbitt, Reg. No. 39,591; Carol F. Barry, Reg. No. 41,600; Jordan Michael 
Becker, Reg. No. 39,602; Bradley J, Bereznak, Reg. No. 33,474; Michael A. Bernadicou, Reg. No. 35,934; 
Roger W. Blakely, Jr., Reg. No. 25,831; Gregory D. Caldwell, Reg. No. 39,926; Ronald C. Card, Reg. No. 
P44,587; Thomas M. Coester, Reg. No. 39,637; Stephen M. De Klerk, under 37 C.F.R, § 10.9(b); Michael 
Anthony DeSanctis, Reg. No, 39,957; Daniel M. De Vos, Reg, No. 37,813; Robert Andrew Diehl, Reg. No. 
40,992; Matthew C. Fagan, Reg. No. 37,542; Tarek N. Fahmi, Reg. No. 41,402; James Y. Go, Reg. No. 
40,621; James A. Henry, Reg. No. 41,064; Willmore F. Holbrow III, Reg. No. P41,845; Sheryl Sue 
Holloway, Reg, No. 37,850; George W Hoover II, Reg. No. 32,992; Eric S. Hyman, Reg. No. 30,139; Dag 
H. Johansen, Reg. No. 36,172; William W. Kidd, Reg. No. 31 ,772; Erica W. Kuo, Reg. No. 42,775; Michael 
J. Mallie, Reg. No. 36,591; Andre L. Marais, under 37 C.F.R. § 10.9(b); Paul A. Mendonsa, Reg. No. 
42,879; Darren J. Milliken, Reg. 42,004; Lisa A. Norris, Reg. No. P44,976; Chun M. Ng, Reg. No. 36,878; 
Thien T. Nguyen, Reg. No. 43,835; Thinh V. Nguyen, Reg. No. 42,034; Dennis A. Nicholls, Reg. No. 
42,036; Kimberley G. Nobles, Reg. No. 38,255; Daniel E. Ovanezian, Reg. No. 41,236; Babak Redjaian, 
Reg. No. 42,096; William F. Ryann, Reg. 44,313; James H. Salter, Reg. No. 35,668; William W. Schaal, 
Reg. No. 39,018; James C. Scheiier, Reg. No. 31,195; Jeffrey Sam Smith, Reg. No. 39,377; Maria 
McCormack Sobrino, Reg. No. 31,639; Stanley W. Sokoloff, Reg. No. 25,128; Judith A. Szepesi, Reg. No. 
39,393; Vincent P. Tassinari, Reg. No. 42,179; Edwin H. Taylor, Reg. No. 25,129; John F. Travis, Reg. 
No. 43,203; George G. C. Tseng, Reg. No. 41,355; Joseph A. Twarowski, Reg. No. 42,191; Lester J. 
Vincent, Reg. No. 31,460; Glenn E. Von Tersch, Reg. No. 41,364; John Patrick Ward, Reg. No. 40,216; 
Charles T. J. Weigell, Reg. No. 43,398; Kirk D. Williams, Reg. No. 42,229; James M. Wu, Reg. No, 
P45,241; Steven D, Yates, Reg. No. 42,242; Ben J. Yorks, Reg. No. 33,609; and Norman Zafman, Reg. 
No. 26,250; my patent attorneys, and Andrew C. Chen, Reg. No. 43,544; Justin M. Dillon, Reg. No. 
42,486; Paramita Ghosh, Reg. No. 42,806; and Sang Hui Kim, Reg. No. 40,450; my patent agents, of 
BLAKELY, SOKOLOFF, TAYLOR & ZAFMAN LLP, with offices located at 12400 Wilshire Boulevard, 7th 
Floor, Los Angeles, California 90025, telephone (310) 207-3800, and James R. Thein, Reg. No. 31,710, 
my patent attorney. 
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APPENDIX B 

Title 37, Code of Federal Regulations, Section 1 .56 
Duty to Disclose Information Material to P atentability 

(a) A patent by its very nature is affected with a public Interest. The public interest is best served, 
and the most effective patent examination occurs when, at the time an application is being examined, the 
Office is aware of and evaluates the teachings of all information material to patentability. Each individual 
associated with the filing and prosecution of a patent application has a duty of candor and good faith in 
dealing with the Office, which includes a duty to disclose to the Office ali information known to that individual 
to be material to patentability as defined in this section. The duty to disclosure information exists with respect 
to each pending claim until the claim is cancelled or withdrawn from consideration, or the application becomes 
abandoned. Information material to the patentability of a claim that is cancelled or withdrawn from 
consideration need not be submitted if the information is not material to the patentability of any claim 
remaining under consideration in the application. There is no duty to submit information which is not matenal 
to the patentability of any existing claim. The duty to disclosure all information known to be matenal to 
patentability is deemed to be satisfied if all information known to be material to patentability of any claim 
issued in a patent was cited by the Office or submitted to the Office in the manner prescribed by §§1 .97(b)-(d) 
and 1 98. However, no patent will be granted on an application in connection with which fraud on the Office 
was practiced or attempted or the duty of disclosure was violated through bad faith or intentional misconduct. 
The Office encourages applicants to carefully examine: 

(1 ) Prior art cited in search reports of a foreign patent office in a counterpart application, and 

(2) The closest information over which individuals associated with the filing or prosecution of a 
patent application believe any pending claim patentably defines, to make sure that any material information 
contained therein is disclosed to the Office. 

(b) Under this section, information is material to patentability when it is not cumulative to 
information already of record or being made or record in the application, and 

(1 ) It establishes, by itself or in combination with other information, a prima facie case of 
unpatentability of a claim; or 

(2) It refutes, or is inconsistent with, a position the applicant takes in: 

(i) Opposing an argument of unpatentability relied on by the Office, or 

(ii) Asserting an argument of patentability. 

A prima facie case of unpatentability is established when the information compels a conclusion that a claim is 
unpatentable under the preponderance of evidence, burden-of-proof standard, giving each term in the claim 
its broadest reasonable construction consistent with the specification, and before any consideration is given to 
evidence which may be submitted in an attempt to establish a contrary conclusion of patentability. 

(c) Individuals associated with the filing or prosecution of a patent application within the 
meaning of this section are: 

(1 ) Each inventor named in the application; 

(2) Each attorney or agent who prepares or prosecutes the application; and 

(3) Every other person who is substantively involved in the preparation or prosecution of the 
application and who is associated with the inventor, with the assignee or with anyone to whom there is an 
obligation to assign the application. 

(d) Individuals other than the attorney, agent or inventor may comply with this section by 
disclosing information to the attorney, agent, or inventor. 
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